NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

All for One: LLMs Solve Mental Math at the Last Token With Information Transferred From Other Tokens

https://doi.org/10.18653/v1/2025.emnlp-main.1565

Mamidanna, Siddarth; Rai, Daking; Yao, Ziyu; Zhou, Yilun (January 2025, Association for Computational Linguistics)

Full Text Available
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

Rai, Daking; Zhou, Yilun; Feng, Shi; Saparov, Abulhair; Yao, Ziyu (July 2024, arXiv)

Full Text Available
IntelliExplain: Enhancing Conversational Code Generation for Non-Professional Programmers

Yan, Hao; Latoza, Thomas D; Yao, Ziyu (May 2024, arxiv)

Full Text Available
An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs

https://doi.org/10.18653/v1/2024.acl-long.387

Rai, Daking; Yao, Ziyu (January 2024, Association for Computational Linguistics)

Full Text Available
Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance

https://doi.org/10.18653/v1/2024.findings-acl.371

Srivastava, Saurabh; Huang, Chengyue; Fan, Weiguo; Yao, Ziyu (January 2024, Association for Computational Linguistics)

Full Text Available
Synthetic Question Value Estimation for Domain Adaptation of Question Answering

https://doi.org/10.18653/v1/2022.acl-long.95

Yue, Xiang; Yao, Ziyu; Sun, Huan (May 2022, ACL 2022)

Full Text Available
CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering

https://doi.org/10.1109/BIBM52615.2021.9669300

Yue, Xiang; Zhang, Xinliang; Yao, Ziyu; Lin, Simon; Sun, Huan (December 2021, 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM))

Clinical question answering (QA) aims to automatically answer questions from medical professionals based on clinical texts. Studies show that neural QA models trained on one corpus may not generalize well to new clinical texts from a different institute or a different patient group, where large-scale QA pairs are not readily available for model retraining. To address this challenge, we propose a simple yet effective framework, CliniQG4QA, which leverages question generation (QG) to synthesize QA pairs on new clinical contexts and boosts QA models without requiring manual annotations. In order to generate diverse types of questions that are essential for training QA models, we further introduce a seq2seq-based question phrase prediction (QPP) module that can be used together with most existing QG models to diversify the generation. Our comprehensive experiment results show that the QA corpus generated by our framework can improve QA models on the new contexts (up to 8% absolute gain in terms of Exact Match), and that the QPP module plays a crucial role in achieving the gain.
more » « less
Full Text Available
Learning Structural Edits via Incremental Tree Transformations

Yao, Ziyu; Xu, Frank; Yin, Pengcheng; Sun, Huan; Neubig, Graham (January 2021, The Ninth International Conference on Learning Representations 2021 (ICLR'21))
null (Ed.)
Full Text Available
Learning Structural Edits via Incremental Tree Transformations

Yao, Ziyu; Xu, Frank F.; Yin, Pengcheng; Sun, Huan; Neubig, Graham (January 2021, International Conference on Learning Representations)
null (Ed.)
Full Text Available
An Imitation Game for Learning Semantic Parsers from User Interaction

Yao, Ziyu; Tang, Yiqi; Yih, Wen-tau; Sun, Huan; Su, Yu (January 2020, 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP'20, long))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records